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The bounded confidence model of opinion dynamics, introduced by Deffuant et al, is a 
C^~) stochastic model for the evolution of continuous-valued opinions within a finite group of 

^ peers. We prove that, as time goes to infinity, the opinions evolve globally into a random 

OO set of clusters too far apart to interact, and thereafter all opinions in every cluster 

converge to their barycenter. We then prove a mean-field limit result, propagation of 
chaos: as the number of peers goes to infinity in adequately started systems and time 
C^~) is rescaled accordingly, the opinion processes converge to i.i.d. nonlinear Markov (or 

McKean-Vlasov) processes; the limit opinion processes evolves as if under the influence 
of opinions drawn from its own instantaneous law, which are the unique solution of 
a nonlinear integro-differential equation of Kac type. This implies that the (random) 
empirical distribution processes converges to this (deterministic) solution. We then prove 
. . that, as time goes to infinity, this solution converges to a law concentrated on isolated 

^ opinions too far apart to interact, and identify sufficient conditions for the limit not 

to depend on the initial condition, and to be concentrated at a single opinion. Finally, 
we prove that if the equation has an initial condition with a density, then its solution 
has a density at all times, develop a numerical scheme for the corresponding functional 
equation, and show numerically that bifurcations may occur. 

Keywords: Social networks; reputation; opinion; mean-field limit; propagation of chaos; 
nonlinear integro-differential equation; kinetic equation; numerical experiments. 
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1. Introduction 

Some models about opinion dynamics (or belief or gossip propagation, etc.) are 
based on binary values |1 Z | 3 | 35 | 27 | 42 ]37] anc j often lead to attractors that display uni- 
formity of opinions. These models are not valid for scenarios such as the social 
network of truck drivers interested in the quality of food of a highway restaurant or 

*J. Gomez-Serrano's research was done while being an exchange student at EPFL. 
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2 Gomez- Serrano, Graham and Le Boudec 

the critics' ratings about the new opening movies, for which it is required to have 
a continuous spectrum of opinions, as is also the case in politics when people are 
positioned on a scale going from extreme left-wing to right-wing opinions.^ 

The bounded confidence model introduced by Deffuant et al^ is a popular 
model for such scenarios. Peers have [0, 1] -valued opinions; repeatedly in discrete 
steps, two peers are sampled, and if their opinions differ by at most a deviation 
threshold then both move closer, in barycentric fashion governed by a confidence 
factor. These parameters are the same for all peers, and the system is in binary 
mean-field interaction. The model has been studied and generalized, notably to 
other interaction graphs than the fully- connected one| 44 l 41 l 43 l 14 l 24 l to vector- valued 
opinions j 34 l 44 l and to peer-dependant deviation thresholds^. 

Reputation systems have lately emerged due to the necessity to measure trust 
about users while doing transactions over the internet; popular examples can be 
found in e-Bay^3 or BizrateP^ Some models for trust evolution and the potential 
effects of groups of liars attacking the system can be seen as a generalization of 
the bounded confidence model, in particular when there are no liars nor direct 
observations and the system evolves only by interaction between the peers The 
"Rendez-vous" model used by Blondel et al ^ has qualitative resemblance to the 
model used in this paper; like ours, it converges to a finite number of clusters in 
finite time for the finite N case. However, the interaction model is different, and 



our techniques (based on convexity and conservation of mean, see Proposition 3.2) 
do not seem to apply to this model. 

The mean-field approximation method for large interacting systems has a very 
long history. Its heuristic and rigorous use started in statistical physic a l 22 ! 31 ! 6 ! and 
entered many other fields, notably communication networks )45|25|20|2i|i8|i9|ig| TCP 
connections) 5 ! 39 ! 4 !^! robot swarms,^ transportation systemspland online reputation 
system £ 8 l 33 l 32 l in which is particularly appealing since the number of users may be 
very large (over 400 million for Facebook^). 

This paper provides some rigorous proofs of old and new results on the Deffuant 
et al^ model, which has been studied intensively, but essentially by heuristic argu- 
ments and simulations. Notably, justifying the validity of the mean-field approach 
is not a simple matter, and classical methods do not apply, as seen below. 

We prove that as time increases to infinity, opinions eventually group after some 
random finite time into a constant number of clusters, which are separated by more 
than the deviation threshold, and cannot influence one another. Thereafter, all opin- 
ions within every such cluster converge to their barycenter. The limit distribution 
of opinions is thus of a degenerate form, in which there are only a small number of 
fixed opinions which differ too much to influence each other, called a "partial con- 
sensus" ; when it is constituted of one single opinion, it is called a "total consensus" . 
Note that the limit distribution is itself random, i.e. different sample runs of the 
same model with same initial conditions always converge, but perhaps to different 
limiting distributions of opinions. 

We then prove a mean-field limit result, called "propagation of chaos" in statis- 
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tical mechanics: if the number of peers goes to infinity, the systems are adequately 
started, and time is rescaled accordingly, then the processes of the opinions con- 
verge in law to i.i.d. processes. Each of these is a so-called nonlinear Markov (or 
McKean-Vlasov) process, corresponding to an opinion evolving under the influence 
of opinions drawn independently from the marginal law of the opinion process itself, 
at a rate which is the limit of that at which a given peer in the finite system encoun- 
ters its peers. Moreover, these marginals are the unique solution of an adequately 
started nonlinear integro-differential equation. 

This implies a law of large numbers: the empirical measures of the interacting 
processes converge to the law of the nonlinear Markov process. Such process level 
results imply results for the marginal laws, but they are much stronger: limits are 
derived for functionals of the sample paths, such as hitting times or extrema. In 
particular, a functional law of large numbers holds for the marginal processes of 
these empirical measures, with limit the solution of the integro-differential equation. 

The probabilistic structure of this limit equation is similar to that of kinetic 
equations such as the cutoff spatially-homogeneous Boltzmann or Kac equations, 
classically used in statistical mechanics to describe the limit of certain particle 
systems with binary interaction. Under quite general assumptions, satisfied here, it 
has long been known that it is well-posed in the space of probability laws, and that 
if the initial law has a density, then the solution has a density at all times satisfying 
a functional formulation of this equation. 

Remark 1.1. There are two main difficulties in the propagation of chaos proof: 

(1) the interaction is binary mean-field, since two opinions change simultaneously, 

(2) the indicator functions related to the deviation threshold are discontinuous. 

A system in which only one opinion would change at a time would be in simple mean- 
field interaction, and one could write equations for the opinions in almost closed 
form, which could be passed to the limit in various classical ways. This cannot be 
done for binary interaction, in which there is much more feedback between peers; 



moreover, this would require continuous coefficients. See Section 4.2 for details 



Such difficulties have been solved before®'. In order to adapt results obtained 
for a class of interacting systems inspired by communication network model^221HU 
using stochastic coupling techniques, which can be applied to various Boltzmann 
and Kac model^H2E3, we introduce an intermediate auxiliary system, a continuous- 
time variant of the discrete-time model of Deffuant et al^ interacting at Poisson 
instants, which itself constitutes a relevant opinion model. 

For this auxiliary system, we prove propagation of chaos, in total variation norm 
with estimates on any finite time interval. We then control the distance between this 
auxiliary system and the Deffuant et al. model, and prove propagation of chaos for a 
weaker topology, but still at the process level and allowing discontinuous measurable 
dependence on the [0, l]-values taken by the opinions. 
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The method can be easily generalized, for instance to vector-valued opinions, or 
to randomized interactions with a joint law governing whether one or both peers 
change opinion and by how much; for instance, choosing uniformly at random one 
peer to change opinion and leaving the other fixed would lead to a simple mean-field 
interacting model, and the limit model would be slowed down by a factor two. 

To the best of our knowledge, this is the first rigorous mean-field limit result for 
this model. Similar integro-differential equations were used without formal justifica- 
tion before) 12 ! I 29 l and appear to be incorrect by a factor 2 (perhaps by disregarding 
that two peers change opinion at once), which illustrates the interest of deriving 
the macroscopic equation from a microscopic description, as we do here. 

We thank a referee to have brought to our attention the preprint Como- 
FagnaniP3] It contains results for the marginal laws of a continuous-time variant 
of the model with two major simplifications: the interaction is simple mean-field 
(only one opinion changes at a time), and the indicator functions are replaced by 
Lipschitz-continuous functions; this removes difficulties ([I]) and ^ in Remark 
to which its techniques do not apply. We have overcome these difficulties in the 
precise model of Deffuant et ai.J^and have given much stronger results, for process 
laws in total variation norm and not for marginal laws in weak topologies. 

One expects that the long-time behavior for the mean-field limit should be 
highly related to the behavior for an large number of peers of the long-time limit 
of the finite model. This heuristic inversion of long-time and large-number limits 
can be sometimes rigorously justified, for instance by a compactness-uniqueness 
method). 45 l 25 l 18 l 19 ]but here the limit nonlinear integro-differential equation may have 
multiple equilibria, and formal proof would constitute a formidable task. 

We prove that the long-time behavior of the solution of the limit integro- 
differential equation is similar to that of the model with finitely many peers: it 
converges to a partial consensus constituted of a small number of fixed opinions 
which differ too much to influence each other. 

We then develop a numerical method for the limit equation, and use it to ex- 
plore the properties of the model. We observe phase transitions with respect to 
the number of limit opinions, while varying the deviation threshold for some fixed 
initial condition. We model the scenario of a company fusion, dividing the workers 
into an "undecided" group and two "extremist" factions, and obtain that having 
20% of the workers "undecided" is enough to achieve consensus between all. 

Last, we establish a bound on the deviation threshold, allowing to determine if 
there is total consensus or not, under the assumption of symmetric initial conditions. 

In the sequel, Section [2] describes the finite model, and Section [3] studies some of 
its long time properties. Section [4] rigorously derives the mean-field limit, Section[5] 
studies some of its long time properties, and Section [6] is devoted to numerical 
results. The appendix contains some probabilistic complements in Section A, the 
details of the algorithm in Section B and all proofs in Section C. 
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2. Interacting system model, and reduced descriptions 

The model for N > 2 interacting peers introduced by Deffuant et aP^is as follows. 
The random variable (r.v.) Xf (k) with values in [0, 1] denotes the reputation record 
kept at peer i G {1, . . . , N} at time k € N = {0, 1, . . . }, representing its opinion (or 
belief, etc.) about a given subject, the same for all peers. The discrete-time process 
of the states taken by the system of peers is 

X N = (X N (k),k G N) , X N {k) = (fc))l<i<iV , 

and evolves in function of the deviation threshold A € (0, 1] and the confidence 
factor w G (0,1). At each instant k, two peers i and j are selected uniformly at 
random without replacement, and: 

• if |Xf (k) - Xf{k)\ > A then X N (k + 1) = X N (k), the two peers' opinions 
being too different for mutual influence, 

• if \X^(k) — Xj(k)\ < A then the values of peers i and j are updated to 

X?(k + 1) = wX?(k) + (1 - w)X?(k) , 
Xf(k + 1) = wXf(k) + (1 - w)X?(k) , 

and the values of the other peers do not change at time k + 1, the two peers 
having sufficiently close opinions to influence each other. 

Small values of A and large values of w mean that the peers trust very much their 
own opinions in comparison to the new information given by the other interacting 
peer. The extreme excluded values A = or w = 1 correspond to peers never 
changing opinion, and w — to peers switching opinions if close enough. For w — 
1/2, two close-enough peers would both end up with the average of their opinions. 

A reduced, or macroscopic, description of the system is given by the empirical 
measure A N , and by its marginal process M N = (M N (k),k G N) also called the 
occupancy process, 

1 N 1 N 1 N 

AA,= «EV = ^E S (x? (*),fceN) . MN ( k ) = jj S *f(k) ■ 

The random measure A N has samples in P([0, 1] N ), the space of probability mea- 
sures on [0, 1] N ; its projection M N = (M N (k),k G N), which carries much less 
information, has sample paths in 'P([0,1]) N , the space of sequences of probability 
measures on [0, 1]. For measurable g : [0, 1] N — > K and h : [0, 1] — > R, 

N N 

^ AN ) = ^E^)' (h,M N (k)) = i£)M^(fe))- 
»=i i=i 

We will also re-scale time as t — ^, and consider in particular the rescaled 
occupancy process M N = {M N (t),t G R+) given by M N (t) = M N ([Nt\), which 
in Section E] will be shown to converge to a deterministic process (m(t),t £ 
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3. Long-time behavior of the finite N model 

We consider a fixed finite number of peers and let time k go to infinity. We prove 
that the distribution of peer opinions M N (k) converges almost surely (a.s.) to a 
random distribution M N (oo). Note that the limiting distribution M N (oo) depends 
on chance as well as on the initial condition. We prove that M N (oo) is a combination 
of at most [4] Dirac measures at points separated by at least A. A key observation 
here is that if h is any convex function then (h, M N (k)) is non-increasing in k. 
Dittmer and Krause E3EH obtained similar results, but for a deterministic model. 

Definition 3.1. We say that v G "P[0, 1] is a partial consensus with c components 
if v = YXi=i a ™$x m with x m € [0, 1], \x m — x m i I > A for to ^ to', and a m > 0. 
Necessarily c < ["4H and Y2m=i am = !• H c = 1, i.e., if v is a Dirac measure, we 
say that v is a total consensus. 

If M N (k) is a partial consensus, then peers are grouped in a number of compo- 
nents too far apart to interact, and within one component all peers have the same 



value. Thus, a partial consensus is an absorbing state for M , and Theorem 3.9 
below will show that M N (k) converges a.s., as k — > oo, to one such state. 



3.1. Convexity and Moments 

We start with results about convexity and moments, which are needed to establish 
the convergence result and are also of independent interest. 

Proposition 3.2. For any convex function h : [0, 1] — > R, any x,y and w in [0, 1], 

h (wx + (1 — w)y) + h (wy + (1 — w)x) — h{x) — h(y) < 

with equality when h is strictly convex possible only if x — y or w — or w = \ . 

The following corollary is immediate from the interaction structure of the model. 

Corollary 3.3. If h : [0,1] — > K is a convex function, then (h,M N (k)) is a non- 
increasing function of k along any sample path. 

Applying this to h(x) — x, h(x) — —x and h(x) = x n yields that in any sample 
path, the first moment is constant and other moments are non-increasing with time. 

Corollary 3.4. For n = 1,2,... and k e N, let ^(k) = jfT l fLi x f( k ) n de- 
note the n-th moment of M N (k), and let o~ N (k) be the standard deviation given by 
a N (k) 2 = fj£(k) ~ <(fc) 2 . Then: 

(1) The mean (J,^(k) is stationary in k, i.e., ^ (k) — (0) for all k. 

(2) The moments and the standard deviation are non-increasing in k: if k < k' 
then ^(k) > ^(fc') and a N (k) > <j N {k'). 
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Moreover, stationarity of moments is equivalent to reaching partial consensus: 

Proposition 3.5. If M N (k) is a partial consensus, then fj,^ (k') — jti„ (k) for all 
n > 1 and k' > k. Conversely, if for some n > 2 there exists a (random) instant k 
such that n%{y) = M^(fe) for all k' > k, then M N (k') = M N (k) for all k' > k and 
M N (k) is a partial consensus, almost surely. 



3.2. Almost Sure Convergence to Partial Consensus 

Definition 3.6. We say that two peers i and j are connected at time k if their 
values x and y satisfy \y — x\ < A. We say that F C {1,2,..., N} is a cluster at 
time k if it is a maximal connected component. 

In other words, a cluster is a maximal set of peers such that every peer can pass 
the deviation test with one neighbour in the cluster. The set of clusters at time k 
is a random partition of the set of peers. The following proposition states that a 
cluster can either split or stay constant, but cannot grow. 

Proposition 3.7. Let C N (k) = {C%, . . . , Ci} be the set of clusters at time k. Then 
either C N (k + 1) = C N (k) orC N {k + l) = (C N (k) \ C tl ) U C where C is a partition 
of Ce lt for some t\ g {1, ...,£} . 

The number of clusters is thus non decreasing, and since it is bounded by [^] 
it must be constant after some time, yielding the following: 

Corollary 3.8. There exists a random time K N , a.s. finite, such that 

C N {k) = C N (K N ) for k > K N . 

Finally, we prove that the occupancy measure converges to a partial consensus 
(see [l] for the usual weak topology on?[0,l]): 

Theorem 3.9. As k goes to infinity, M N (fe) converges almost surely, for the weak 
topology on V[0, 1], to a random probability M N (oo), which is a partial consensus 
with L N components, where L N :— Ga,rd(C N (K N )) is the final number of clusters. 

Theorem |3.9| notably implies that there is convergence to total consensus if and 
only if L N = 1. The probability p* :— V(L N — 1) of convergence to total consensus 
is not necessarily or 1, but: 

(1) If the diameter of M N (0) is less than A (i.e., max itj |Xf (0) - ^(O)) < A) 
then p* — 1 (obvious); 

(2) If there is more than 1 cluster in M N (0) then p* — (see Proposition 3.7 1. 



4. Mean-field limit results when N goes to infinity 

This section is devoted to a rigorous statement for the following heuristic statistical 
mechanics limit picture: all peers act independently, as if each were influenced by an 
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infinite supply of independent statistically similar peers, of which the instantaneous 
laws solve a nonlinear equation obtained by consistency from this feedback. 

In statistical mechanics and probability theory, such convergence to an i.i.d. 
system is called chaoticity, and the fact that chaoticity at time implies chaoticity 
at further times is called propagation of chaos. 

Some other probabilistic definitions and facts are recalled in [I] 
We introduce an intermediate auxiliary system, in which the peer meet at the 
instants of a Poisson process, which itself constitutes a relevant opinion model. 
We adapt results in Graham-MeleardHil2£l to prove that the sample paths of the 
auxiliary system are well approximated by the limit system. We eventually control 
the distance between the auxiliary system and the model of Deffuant et aZP^ 



4.1. Mean-field regime, rescaled and auxiliary systems 

The number TV of peers is typically large, and we let it go to infinity. At each time- 
step two peers are possibly updated, and the empirical measures have jumps of 
order jj, hence time must be rescaled by a factor N. This is a mean-field limit, in 
which time is usually rescaled by physical considerations (such as "the peers meet 
in proportion to their numbers"). It is also related to fluid limits. 

For a Polish space S, let V{S) denote the space of probability measures on <S, 
with the Borel a-field, and D(K+,5) denote the Skorohod space of paths from R + 
to S which are right-continuous with left-hand limits, with the product cr-ficld. 

A non-trivial continuous-time limit process is expected for the rescaled system 

X N = (Xf^jv , X N = (X N (t),teR + ) = (X N ([Nt\),teR + ), (4.1) 

with sample paths in D(R + , [0, l] w ). The corresponding empirical measure A N and 
the process M N = (M N (t), teR+) constituted of its marginal laws are given by 

N N N 

lN = n £ s ^ = n £ *<*r<«>.«*4.> ' w ® = if E ' (42) 

i—1 z=l i—1 

respectively with samples in V(D(M. + , [0, 1])) and sample paths in D(R + ,'P[0, 1]). 

An auxiliary (rescaled) system is obtained by randomizing the jump instants of 
the original model by waiting i.i.d. exponential r.v. of mean -k between selections, 
instead of deterministic durations. A convenient construction using a Poisson 
process (A(t),t £ R+) of intensity 1 is that, with sample spaces as above, 



X N = (Xf ) 1 < 1 < N , X N = (X N (t),t g K+) = (X N (A(Nt)),te R+) , 

1 N _ _ _ i N 

AN =nY, 6 x?, M N = (M N (t),teR + ), M N (t) = -J2^f( t) 
i=i * »=i 



(4.3) 



If Xfc for k > are given by T = and the jump instants of (A(i),t G K+), then 

X N (t)=X N (t')=X N (k), - <t<—, ^<i'<^±i. (4.4) 
v ; v > w> N - N , N - N k j 
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Note that M N Jt) = M N ([Nt\) and M N (t) = M N (A(Nt)), but that the relation- 
ship between A N and A N and A N is more involved. 

The process X N is a pure-jump Markov process with rate bounded by N, at 
which two peers are chosen uniformly at random without replacement, say i and j 
at time t, and: 

• if \X t N (t-) - Xf(t-)\ > A then X N (t) = X N (t-), 

• if \X-*(t—) — X^{t—)\ < A then only the values of peers i and j change to 

X?{t) = wX?{t-) + {\-w)X?{t-), 
X?{t) = wXf{t-) + {l-w)X»{t-). 

Remark 4.1. Each of the N ( N ^~ 1 ') unordered pairs of peers is thus chosen at rate 
N 2 _ 1 = N/^-^— — , and then both peers undergo a simultaneous jump in their 
values if these are close enough. Each peer is thus affected at rate 2 = (N — 

The generator A N of X N = (X^i^jv acts on / e L°°([0, 1} N ) (the Banach 
space of essentially bounded measurable functions on [0,1]"") as 
2 

A N f((x n ) 1 < n < N ) = -^—^ ^ lf(( X n)l< n < N ) - /((ar„)l<n<Jv)]l{| a ; i -x i |<A} 

\<i<j<N 

(4.5) 

where {xnfi <n<N is obtained from (a; n )i<n<JV by replacing Xi and Xj with ws, + 
(1 — w)xj and wxj + (1 — w)xi and leaving the other coordinates fixed. Its operator 
norm is bounded by 2N, and the law of the corresponding Markov process X N is 
well-defined in terms of the law of X N (0) = X N (0). 

For 1 < k < N, this generator acts on h k e L°°([0, 1} N ) which depend only on 
the fc-th coordinate, of the form /ife((a; n )i<„<jv) = h(xk) for some h G L°°[0, 1], as 

jj—j ^2 [h(wx k + (1 - w)xj) - h(x k )]l{\ Xk - Xj \<A} 

l<j<N:j^k 

:=A (n=1 S S X] {dv)\h{x k ) (4.6) 

where the generators A{[i) act on h G L°°[0, 1] as 

A(ji)h(x) = 2([h(wx + (1 - w)y) - M*)]1{|*-»|<A} . Mv)) , V & V[0, 1] . (4.7) 

Heuristically, if the X^(0) converge in law to i.i.d. r.v. of law m , then the 
X" are expected to converge in law to i.i.d. processes of law Q, the law of a time- 
inhomogeneous Markov process X with initial law tuq and generator A(m(t)) at 
time t G K+, where m(t) = C{X t ) = Q t is the instantaneous law of this same 
process, the marginal of Q. Such a process is called a nonlinear Markov process, or 
a McKean-Vlasov process. Considering the forward Kolmogorov equation for this 



April 19, 2011 5:27 boundedConfArxiv 



10 Gomez- Serrano, Graham and Le Boudec 



Markov process, (m(i), t £ R+) should satisfy the following weak (or distributional- 
sense) formulation of a nonlinear integro-differential equation. 

Definition 4.1 (Problem 1). We say that m = (m(t), t € K+) with m(t) e V[0, 1] 
is solution to Problem 1 with initial value mo G V[0, 1] if m(0) = mo and 

(h,m(t)) - (h,m(0)) = / (.A(m(s))ft,m(s)) <fs 



:= / 2([/i(iwx+ (1 - - /i(x)]1{| x _„|<a} ,m(s)(dy)m(s)(dx))ds (4.8) 
Jo 

for all test functions h 6 L°°[0, 1]; this can be written more symmetrically as 

(h, m(t)) - (h, m(0)) = f ([h(wx + (1 - w)y) + h(wy + (1 - w)x) 
Jo 

- h{x) - h(y)]\.{\ x - y \<A} , m(s)(dy)m(s)(dx)) ds . (4.9) 
The distance in total variation norm of /i and fjf in V(S) is given by 
\H - = sup (0, (tx — fj,') = 2sup{/j,(A) - n'(A) : measurable A C S} . (4.10) 

H0lloc<l 



Theorem 4.2. Consider the generators A{p) given by (4.7), and m in V[0, 1] 



(1) There is a unique solution m = (m(t),t £ M + ) to Problem 1 starting at tuq. 
For the total variation norm on V[0, 1], t > m(t) is continuous, and m i— > 
(m(t),t € is continuous for uniform convergence on bounded time sets. 

(2) There is a unique law Q = C(X) on D(R + , [0, 1]) for an inhomogeneous Markov 
process X = (X(t),t 6 R + ) with generator A(m(t)) at time t and initial law 
C(X(0)) = niQ. Its marginal Qt = C{X t ) is given by m(t). 

Remark 4.2. Such nonlinear Markov processes and equations are well-known to 
probabilists. The equations 1 have same probabilistic structure as the weak forms 
(2.1), (2.2), (2.4) (with C = 0) of the spatially homogeneous version (without IE- 
dependence) of the Boltzmann equation (1.1) in Graham-MelearcP^l, the weak form 
(1.7) of the (cutoff) Kac equation (1.1)-(1.2) in Desvillettes et al.^ the nonlin- 
ear Kolmogorov equation (2.7) in Graham,^ and the kinetic equation (9.4.4) in 
GrahamP^l The weak formulation involves explicitly the generator of the underly- 
ing Markovian dynamics and allows to understand it more directly. The functional 
formulation (for probability density functions) of this integro-differential equation 
involves an adjoint expression of this generator, and will be seen in Section [6] 

4.2. Difficulties for classical mean-field limit proofs 

The system X N exhibits simultaneous jumps in two coordinates, and is in binary 
mean-field interaction in statistical mechanics terminology. 

A system in which only one opinion would change at a time would be in simple 



mean-field interaction; the generator .4 in (4.5) would be replaced by a simpler 
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expression, which could be written as a sum over i of terms acting only on the 
i-th coordinate in terms of the value Xi and of N 1 _ 1 ^2j^i 5 Xj ■ Consequently, the 
empirical measures would satisfy an equation in almost closed form, which could 
be exploited in various ways to prove convergence to a limit satisfying the closed 
nonlinear equation in which the empirical distribution is replaced by the law itself. 

A binary mean-field interacting system is much more complex, since there is 
much more feedback between peers. It is impossible to relate it in a simple way 
to an independent system, in which the coordinates cannot jump simultaneously. 
Because of that, the coupling methods introduced by SznitmanpSlsee also MelearcP^ 
and Graham-Robert cannot be adapted here. Moreover, these use contraction 
techniques, and the metric used is too weak for the indicator functions. 

Elaborate compactness-uniqueness methods are also used for proofs, see 
Sznitmanp! and also MeleardpS Graham-MelearcPl Section 4, and GrahamJMH 
but require weak topologies for compactness criteria, and continuity properties in 
order to pass to the limit; hence, the indicator functions prevent using them here. 

Remark 4.3. The indicator functions require quite strong topologies. For instance, 
if < a < b = a+A < 1 and m = h(6 a +6b), then there exists M+(0) with support 
not intersecting [a, b] and converging weakly to m(0), and starting there (fc) and 
m+(t) have at least two clusters and support outside [a, b]. There exists also (0) 
with support inside (a, b) and converging weakly to m(0), and (k) and m?(t) 
have one cluster and support inside (a, b) for any k G N, and will be a total consensus 
after some random time. 



4.3. Rigorous mean-field limit results for the auxiliary system 

Systems of this type were studied in Graham-MeleardJHHB see also Ref.[T31 The first 
paper studied a class of not necessarily Markovian multitype interacting systems, as 
a model for communication networks. The second studied Monte-Carlo methods for 
a class of Boltzmann models, and in particular expressed some notions and results 
of the first in this framework. Their results yield the following. 

For k > 1 and T > and laws P and P' on D(R+, [0, l] fe ), let \P-P'\ T denote the 



distance in variation norm (4.10) of the restrictions of P and Q on D([0,T], [0, l] fe ). 
When clear, the processes will be restricted to [0, T] without further mention. 



Theorem 4.3. Consider the auxiliary system (4.3) for N > 2. If the X^(0) := 
(0) are i.i.d. of law mo, then there is propagation of chaos. More precisely, let 
m = (m(t),t £ K.+) and Q be as in Theorem 4-2 for m(0) = mo, and T > 0. 



(1) Forl<k<N, 



\C(X( 



N 



,Xf)-£(Xf) 



C{Xk)\r < 2k(k- 1 



2T + 4T 2 
N - 1 
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and 



<|£(Xf)-Q| T <6 



cxp(2T) - 1 
N+ 1 



(jgj For any <j) : D([0,T], [0, 1]) — >• R such that 



E 



,A 



N 



1 ^ ~ 



A' < 



i=l 



< 



4 + 8T + 16T 2 



Moreover 



^ /V in probab. ~ 

A JV — > Q , 



lT?jV iri probab. 

Ai !> m , 

Af->co 



respectively for the weak topology on "P(_D(R + , [0, 1])) with the Skorohod topology 
on .D(R_|_, [0, 1]), and for the topology of uniform convergence on bounded time 
intervals on D(M. + ,V[0, 1]) with the weak topology on V[0, 1]. 

The assumption that the initial conditions are i.i.d. can be appropriately relaxed, 
as in Theorem 1.4 in Graham- MelearcpH 

These very strong results are obtained for a relevant opinion model given by the 
auxiliary (rescaled) system, and are of independent interest. In the next section we 
will derive from them some weaker results for the original discrete-time model. 

Remark 4.4. The convergence result for A^ is equivalent to convergence in law 
to Q (Ethier-KurtzJSI Corollary 3.3.3). The convergence result for M N implies 
convergence in law to m for test functions which are continuous, bounded, and 
measurable for the product c-field (Ref. [TB] Theorem 3.10.2). Separability issues 
restrict these results, see [TJ in fact, convergence of A N holds for any convergence 
induced by a denumerable set of bounded measurable functions. 



4.4. From the auxiliary to the rescaled system 

For k > 1, let denote the Skorohod metric on D(K+, [0, l] fc ) given by (3.5.21) in 
Ethier-Kurt for the atomic metric (x,y) M- l{ K ^ a } on [0, l] k (which induces the 
topology of all subsets of [0, l] k , for which any function is continuous). Note that 
ak is measurable with respect to the usual Borel cr-field on [0, l] fe x [0, l] k . 

A time-change is an increasing homeomorphism of i.e., a continuous func- 
tion from R + to R+ which is null at the origin and strictly increasing to infinity. 
Two paths are close for at if there is a time-change close to the identity such that 
the time-change of one path is equal to the other path. 



Eq. (4.4) is the key to obtain the following quite general result showing that 
the rescaled system X N is very close to the the auxiliary system X , up to a 
well-controlled (random) time-change. 



Theorem 4.4. Consider the rescaled system (4.1) and the auxiliary system (4.3) 
for N > 2. Then limjv->oo a N (X N , X ) — in probability. 
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This result and Theorem 4.3 now yield the main mean-field convergence result. 



Theorem 4.5. Consider the rescaled system (4.1) for N > 2. If the X^(0) :— 
(0) are i.i.d. of law mo, then there is propagation of chaos. More precisely, let 



m = (m(t),t G HL|_) and Q be as in Theorem 4-2 for m(0) = mo. 

(1) Forl<k<N, 

lim £(Xf,...,Xf) = Q® fe , 

N— >oo 

for the weak topology on V(D(R + , [0,l] fc )) induced by test functions which are 
either uniformly continuous for the Skorohod metric a^, bounded, and mea- 
surable for the usual product a -field (for the usual Borel a -field on [0, l] k ), or 
continuous for the usual Skorohod topology (for the usual metric on [0, 1] ) and 
bounded. 

(2) For the usual topology of [0, 1], 

T /V in probab. „ T^N in piobab. 

A N — > Q , M N — > m , 

respectively for the weak topology on V(D(R + , [0, 1])) with the Skorohod topology 
on D(K_|_, [0, 1]), and for the topology of uniform convergence on bounded time 
intervals on -D(R_|_, T^O, 1]) with the weak topology on V[0, 1]. 

The assumption that the initial conditions are i.i.d. may again be relaxed. For 
the second result, see again Remark |4.4| 



5. Infinite TV Model 

We now study the mean-field limit m = (m(t),t € ]R + ) obtained in Section [5] when 
N goes to infinity. As for the finite N model, we find that there is convergence 
to a partial consensus as t goes to infinity. The limit may depend on the initial 
conditions, as might the random limit when N is finite. We are able to say more, 
and notably find tractable sufficient conditions for the limit to be a total consensus. 



5.1. Convexity and Moments 



Applying Proposition 3.2 to the equivalent definition of Problem 1 given by (4.9 1 
yields the following: 

Corollary 5.1. Letm = (m(t),t e M + ) be a solution of Problem 1. If h : [0, 1] — > R 

is convex, then (h, m(t)) is a non-increasing function oft. Moreover, for n — 1,2,... 
and t G R + , let fi n (t) = L x n m{t)(dx) denote the n-th moment of m(t), and cr(t) 
its standard deviation (i.e., a(t) 2 = ^{t) — Hi(t) 2 ). Then: 

(1) The mean fJ-i{t) is stationary: fJ-i(t) = /ii(0) for all t. 

(2) The moments fi n (t) are non-increasing in t: if t\ < t2 then fi n (ti) > ^(^2)- 

(3) The standard deviation o~(t) is also a non-increasing function oft. 
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Furthermore, we have some bounds. 
Proposition 5.2. For all t>0, we have cr(0) > <r(t) > a^e'^^-^ . 

Note that Corollary |5.1| and Proposition |5.2| generalize results of Ref 7, which 
established similar results for the case w — 1/2. However, the bound in Proposi- 
tion [5?2] is different, as the equation considered in Ref. [7] misses a factor 2. 



5.2. Convergence to Partial Consensus 

It is immediate that a partial consensus is a stationary point for Problem 1, i.e., 
if (m(t), t £ K_|_) is solution of Problem 1 with initial value a partial consensus mo, 



then m(t) = mo for all t. Conversely, we show, in Theorem 5.5 below, that any 
trajectory (m(t),t € R+) converges to a partial consensus. 

It is useful to consider the essential sup and inf of m(t), defined as follows. 

Definition 5.3. For v G V[0, 1], let esssup(j/) = inf{6 G [0, 1], v ((&, 1]) = 0} and 
ess inf (y) = sup{a G [0, 1], v ([0, a)) = 0}. 

Note that if essinf(z/) = a and esssup(V) = b, then the support of v is included 
in [a, b], i.e., for any measurable B C [0, 1], v{B) — v (B n [a, 6]). 

Proposition 5.4. Lei (m(t),t G R+) 6e solution of Problem 1. Then esssup(m(i)) 
[resp. essinf(m(i))/ is a non-increasing [resp. non- decreasing] function oft. 

See Definition |3.1 1 for partial and total consensus. 



Theorem 5.5. Lei (m(t),t G K+) 6e a solution of Problem 1. As t goes to infinity, 
m(t) converges, for the weak topology on V[0, 1], to some m(oo) which is a partial 
consensus for every A' < A, i.e., of the form m(oo) = J2 m =i a m5 Xm with x m € 
[0, 1], \x m — x m i \ > A for m =^ m' , and a m > 0. 

Note that the limit m(oo) may depend on the initial condition mo, and may or 
may not be a total consensus (as shown in the next section). We are in particular 
interested in finding initial conditions that guarantee that m(oo) is a total consensus. 
The following is an immediate consequence of Proposition |5.4| 



Corollary 5.6. // the diameter of mo is less than A, i.e., if esssup(mo) — 
essinf(mo) < A, then m(oo) is a total consensus. 

Note that the converse is not true: if the diameter of mo is larger or equal than 
A, there may be convergence to total consensus (see next section for an example). 



5.3. Convergence to Total Consensus 

We find sufficient criteria for guaranteeing some upper bounds on the number of 
components of m(oo), in particular, we find some sufficient conditions for conver- 
gence to total consensus. Although the bounds are suboptimal, to the best of our 
knowledge, they are the first of their kind. The bounds are based on Corollary |5.1| 
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First define, for n G {1,2,3,...} and (1q € [0,1], the set P n (^o) of partial 
consensus with n components and mean fj,Q, i.e., v £ P n {n) iff there is some sequence 
< X\ < • • • < x n < 1 with + A < some sequence for i = 1, . . . , n with 

< on < 1 and £)£=i a, = 1 such that v = \ Yh=i ai ^i anci n = Mo- 

Second, for any convex, continuous ft : [0,1] — > M + , let Q n (no,h) be the set 
of strict lower bounds of the image by the mapping v t— > (ft, i/) of P„(/xo), i.e., 
<7 € QnifJ-o, ft) iff for any consensus f with n components and mean fiQ, it holds that 
(ft, i/) > q. If Pn(fJ-o) is empty, let Q n {^o, ft) = R+- 

Note that Q n (/io, ft) is necessarily an interval with lower bound 0. The following 
proposition states that Q n is non decreasing with n. 

Proposition 5.7. For any n g {1, 2,3,...} and fiQ £ [0, 1] and convex continuous 
ft : [0, 1] — > R+ ; it holds that Q n (poi ft) C Q ra +i(/Lto, ft). 

Combining Proposition |5 . 7| with Corollary |5.1| we obtain: 



Theorem 5.8. Let (m(t), t £ ]R + ) be the solution of Problem 1 with initial condition 
mo, and c be the number of components of the limiting partial consensus m(oo). 
Assume that, for some n £ {2, 3, . . . }, some convex continuous ft : [0, 1] — > K + , and 
some q > 0, we have q £ Q„(/io, ft), where [Xq is the mean of tuq. 
Under these assumptions, if (ft, m ) < q then c < n — 1. 

Here is an example of use of the theorem, for n = 2 and h(x) — \x — /j,q\. 

Corollary 5.9. Let (m(t),t be the solution of Problem 1 starting at ijiq. 

Assume that A > | and 1 — A < /i < A, where /j,q is the mean o/m . If 

- 1 2 

\x - /x | m (dx) < — min{/i (A - /i ), (1 - Mo)(A - 1 + /i )} 

iften m(i) converges to total consensus. 

If we apply this to tuq equal to the uniform distribution, we find the sufficient 



o 



condition A > | for convergence to total consensus. In Corollary 
better result, obtained by exploiting symmetry properties. 



5.15 



we find 



Definition 5.10. We say that v £ V[0, 1] is symmetric if the image measure of v 
by x i— > 1 — x is v itself. 

Note that if v has a density /, this simply means that f{x) = f(l — x). Neces- 
sarily, if v is symmetric, the mean of v is |. If a partial consensus v = i a i^x ( 
(with Xi < Xi+i) is symmetric, then x n +i-i = l — Xi and a„+i_i = a,;; in particular, 
if n is odd, Xri+i = \. 

2 z 

Proposition 5.11. Let (m(t),t £ R + ) be a solution of Problem 1 with initial value 
m . If mo is symmetric, then m(t) is symmetric for all t>0. 

We can extend the previous method to the symmetric case as follows. Define SP n 
as the set of symmetric partial consensus with n components, and let q £ SQ n (h) 
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if and only if every symmetric consensus v with n components satisfies (h, v) > q. 
If SP n is empty, then SQ n (h) = R + . We have similarly: 

Proposition 5.12. For any n € {1,2,3,...} and convex continuous h : [0,1] — >■ 
M. + , it holds that SQ n {h) C SQ n+ i{h). 

Theorem 5.13. Let (m(t),t € R + ) be the solution of Problem 1 for a symmetric 
initial condition mo, and c be the number of components of the limiting partial 
consensus m(oo). Assume that, for some n £ {2,3,...}, some convex continuous 
h : [0, 1] — > R_|_, and some q > 0, we have q £ SQ n {h). 

Under these assumptions, if (h, mo) < q then c < n — 1. 



We apply Theorem 5.13 with h(x) = \x — ||. It is easy to see that for v £ 5P 2 



we have (h, v) > which shows the following: 

Corollary 5.14. Let (m(t),t € K+) 6e </ie solution of Problem 1 for a symmetric 
initial condition ttiq. If A > 2 J Q |x — 1 1 mo(dx) then m(t) converges either to a 
total consensus or to a partial consensus with 3 or more components. 

In particular, if mo is the uniform distribution on [0, 1] , then J Q | x — || m (o?x) = 
I and the condition in the previous corollary is A > |, thus we have shown: 

Corollary 5.15. Let (m(t),t € K+) &e t/te solution of Problem 1 with initial con- 
dition the uniform distribution on [0,1]. If A > ^ t/ien m(t) converges to a total 
consensus. 

Corollary 5.16. Let (m(t),t € R+) 6e a solution of Problem 1 with initial condition 
mo = {^-^) So + otSi + (-^r^) ^i- There is convergence to total consensus 

for A > 1 - a if a < \ , or A > \ if a > ~ . 



6. Numerical Approach 

In the mean-field limit, the dynamical behavior of the system of peers can be de- 
scribed by the integro-differential equation given in weak form in Definition |4.1| 
(Problem 1). This equation has no closed-form solution to our knowledge, and we 
have developed a numerical method for it. 

We describe the algorithm, and analyze its precision and complexity. An im- 
portant fact is that this algorithm requires considerably less running time than the 
probabilistic methods used in NeavF^ when N is large (which is not surprising in 
dimension 1). The program consists in 600 lines of C++ code, and the parsing and 
plotting of the results was done using Matlab. 



6.1. Functional formulation of Problem 1 

The numerical method is based on the functional formulation for probability density 
functions (PDFs) obtained by duality from the weak formulation in Definition 4.1 
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of Problem 1. The following result is fundamental in this aspect. 

Theorem 6.1. Let (m(t),t > 0) be a solution of Problem 1. If the initial condition 
mo is absolutely continuous with respect to Lebesgue measure, then so is m(t) for 
every t > 0, and moreover the densities /(•, t) of m(t) satisfy the integro- differential 
equation 

^ - » r 4 "/(^*, t )/b, ( )*-2/ (I , ( ) r 4 /(», t) *. 

dt W J x -Aw \ w J J X _ A 

(6.1) 



Conversely, if f : RxR + — > K zs a solution of Eq.(6.1) such that f(-,t) is a PDF with 
support [0,1] for every t > 0, then the probability measures m(t)(dx) = f(x,t)dx 
solve Problem 1. 

This result and Theorem |4.2| yield an existence and uniqueness result for 



Eq.(6.1|. This equation can be derived in statistical mechanics fashion by balance 
considerations. For the gain term, a particle in state x ^^ w ^ y interacts at rate 2 



(see Remark 4.1 1 with a particle in state y to end up in state x, and the joint density 
for this pre-interaction configuration at time t is ~f{ x ~ w ^ v ,t)f(y,t) (particles 
are "independent before interacting"). The loss term is derived similarly. 



Remark 6.1. As noted in Remark |4.2| this is a Boltzmann-like equation. This 
is made more obvious for w ^ 1/2 by the change of variables leading to post- 
interaction states x and y, which yields the equivalent formulation 

df(x, t) _ 2 y*+A(at -i) / wx -^- w ) y \ / wy -(l-w)x\ 



dt 2w-l J x -a(2w-i) \ 2k; - 1 J \ 2w-l 

-2f(x,t) [ + f(y,t)dy (6.2) 

Jx-A 

more reminiscent of Boltzmann or Kac equations such as (1.1) in Graham-MelearcP^l 
or (1.1)-(1.2) in Desvillettes et aZP^In these, the fact that the gain term involves pre- 
collisional velocities is obscured by the physical symmetries between pre-collisional 
and post-collisional velocites, which are absent here. 

In the rest of this section we assume that the hypothesis of the above theorem 
holds. We show next that if the PDF /(•, 0) is bounded then so is f(-,t) and we can 
control its growth over time. Let 

M(t) d ^ |/(-,t)|oo= sup \f(x,t)\ , t>0. 
xe[o,i] 

Proposition 6.2. Then M(t) < e( » + T^) t (M(0) + 4) - 4 for all t > 0. 



It follows that f(-,t) is bounded for all t, and iteratively, using (6.1|, / is C°° 
on its second variable. Having controlled the growth of f(x, t), it is easy to control 
the growth of its derivatives. 



April 19, 2011 5:27 boundedConfArxiv 



18 Gomez-Serrano, Graham and Le Boudec 



Proposition 6.3. Then |^/(-,*)L < + t^,) ( M (t) + 4 ) f° r all t > 0. 

Iteratively, we obtain the following corollary. 
Corollary 6.4. If |/(-, 0)1^ < oo i/ien |Jjff /(•,*) | < oo for all n and t > 0. 



6.2. Numerical Solution of Eq.(6.1) 



Though Equation (6.1) for the PDF does not appear to have any tractable closed 



form solution, however, it lends itself well to numerical solution. We developed an 



algorithm that gives an approximate solution of Equation (6.1| over some finite time 
horizon T, given some initial condition /(a;,0), assumed to be piecewise constant. 
The algorithm is described in Appendix B. In the rest of this section, we present 
numerical results obtained with the algorithm. 

We study different scenarios for the initial distribution: uniform, extremist versus 
undecided and beta. We find bifurcations as a function of A. Moreover, we compare 
the experimental results with the bounds obtained in section 5 and the probabilistic 
Monte Carlo simulations presented in Ref |T2J The main results are summarized in 
the following table: 



Scenario 


Parameters 


Consensus 


Uniform 


A < 0.27, w € (0,1) 


Partial 


Uniform 


A > 0.27, w e (0,1) 


Total 


Extremists /Und. 


(A, a) below black curve (fig. 


4 




Partial 


Extremists /Und. 


(A, a) above black curve (fig. 


4 




Total 


Extremists/Und. 


(A, a) above red curve (fig. 1 


1) 


Partial (theor. bound) 


Beta 


A > 0.25, w e {0.5,0.75} 


Total 


Beta 


A < 0.25, w e {0.5,0.75} 


Partial 


Beta 


A > 0.2, w = 0.9 


Total 


Beta 


A < 0.2, w = 0.9 


Partial 



Table 1. Summary of the numerical experiments 



6.2.1. General Evolution of the System 

In order to illustrate the behavior of the system as time passes, we show how the 
system evolves from a uniform distribution to one (or more) components, depending 
on the deviation threshold A. We run those sets of experiments for 3 different 
values of w, specifically 0.5, 0.75, and 0.9 and plot the probability function at times 
t = 0, t = 20 and t = 100. The simulations have been done with the parameters 
/ = 200, At = 0.1, T = 100. Although the set of parameters might theoretically 
yield a big error, in practice this error is much smaller. 
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20 


0.2 0.4 0.6 0.8 




I 


1 




0.2 


0.4 


0.G 


0.8 





0.2 0.4 0.6 0.8 1 0.2 0.4 0.G 0.8 1 0.2 0.4 



Fig. 1. w = 0.5. Evolution of m(t) at times t = 0, 20, 100. 



t = t = 20 t=100 




0.2 0.4 0.E 0.8 1 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 



Fig. 2. w = 0.75. Evolution of m(t) at times t = 0, 20, 100. 

From the images, we see that w does not seem to impact the number of compo- 
nents of m(oo), but the weights do depend on w. 

6.2.2. Extremists versus Undecided 

We now present some common scenarios: imagine a company fusion and the opinion 
of the employees about the new company, or a rough categorization of voters in 
an election. We can characterize these opinions as extremists (either or 1) or 
undecided (0.5). The proportion of opinions is a for the undecided and for 



April 19, 2011 5:27 boundedConfArxiv 



20 Gomez- Serrano, Graham and Le Boudec 




Fig. 3. w = 0.9. Evolution of m(t) at times t = 0, 20, 100. 



each of the extremist classes. To simulate this, we have approximated the initial 
conditions (Diracs) to constant splines of value la and 1^-^- respectively, centered 
at their corresponding points, such that the initial condition has mass 1. We plot 
the result (1 component, i.e. total consensus, or 2 components) for each pair (a, A) 
in [0, 1] x [i, l] in Figure|4j We know from Corollary 5.16 that total consensus must 
occur for A > a and we see that the region of convergence to total consensus is a 
bit larger, and slightly depends on w. 

Note that values of A smaller than | would result in no motion at all. We do 
this for the previous set of values for w and find that in every case, the fraction 
of undecided people necessary to achieve consensus is much smaller than what one 
would expect. 

We also plot the center of masses of the first half of the distribution to show 
that it is not a smooth function of a and that close to the critical value A c (a) there 
is a jump. We did this for the previous 3 values of w but show only one result for 
brevity. 



6.2.3. Initial Uniform Conditions, Impact of A 

We present here the evolution of the number of components with respect to A, using 
as initial condition a uniform distribution. Note that we have capped the situations 
with more than 7 components into the category " 7 or more" , which are represented 
by 7 in the graph. For a component to be considered as such, we require that it has 
at least 1% of the total mass. Otherwise we consider it as a zero. Again, the results 
are plotted for the 3 different values of w. 

We observe that the results are almost independent of w, as there is almost 
no difference between the 3 curves (see Figure [6] for the combined plot of all 3 
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(c) w = 0.9 

Fig. 4. Bifurcation diagram for extremists and undecided. The curly line separates the region of 
convergence to total consensus (above) from convergence to a partial consensus with two compo- 
nents. The straight line is the sufficient condition in Corollary |5.16| 



Bifurcation for w -0.9: Center of maeses of the first half 




D 005 01 015 02 25 



Fig. 5. w = 0.9. Center of masses of the first half, showing that the transition is abrupt. 

functions) . Another interesting thing to remark is that if we compare our results for 
w = 0.5 with the deterministic model with the ones in Ref[T2lwith the probabilistic 
model, the intervals of A in which they have a high probability of convergence to 
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n components correspond to the same intervals in which we have convergence to n 
components. This suggests that the approximation for N — oo is good enough to 
preserve properties such as the final state. 



6 - I 

5 ' \ 

1 1 
3- 1 1 

2 - ' n 

1 1 1 1 — ' 1 1 1 1 1 1 

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.0 O.S 1 
A 

Fig. 6. A vs Number of components of m(oo). Uniform initial conditions. Blue - w = 0.5 (below 
black), Red - w = 0.75, Black - w = 0.9 



6.2.4. Beta Distribution as Initial Condition 

Here we study the evolution of the number of components with respect to A, using 
as initial condition a Beta(l,6) distribution. The functions that have 5 or more com- 
ponents have been put into the category represented with a 5. Again, we consider 
a component if it has 1% of the total mass or more. We present the results for the 
3 different values of w. 

We can observe again the same phenomenon as in the uniform case, namely that 
the influence of w is negligible. If we compare the results from the ones in Subsection 
7.3, we can conclude that the final result depends on the initial condition, even for 
the same parameters w and A. Moreover, we can see that for a fixed (w, A), if wc 
start with a Beta distribution, the number of components will be smaller or equal 
than if we start with a uniform one. This is explained by the fact that with the Beta 
distribution the mass is more concentrated than with the Uniform distribution (in 
our case: to the left) and therefore it should be harder (i.e, A should be smaller) to 
split in the same number of components. 

Appendix A. Probabilistic, Topological and Measurability issues 

In the particular case of probability measures on [0,1], a sequence v n converges 
weakly to v if and only if (/, v n ) converges to (/, v) for any continuous (and hence 
bounded) / : [0, 1] — » R. Equivalently, the cumulative distribution function (CDF) 
of v n converges to the CDF of v at all continuity points of the limit. 
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(c) w = 0.9 

Fig. 7. A vs Number of components of m(oo). Initial condition Beta(l,6). 



More generally, Ethier-Kurt#^ will be the main reference. 

Let S be a metric space with a a- field (not necessarily the Borel cr-field), V(S) 
the space of probability measures on S (for this cr-field), and D(M. + , S) the Skorohod 
space of right-continuous paths with left-hand limits (for this metric). 

When S is given the Borel cr-field, the weak topology of V{S) corresponds to 
the convergences 

Pn P & (/, P n ) > (/, P) , V/ e C b (S,R) 

n—too n—±o£> 

where C&(<S,IR) denotes the space of continuous bounded functions. Convergence in 
law of random elements, defined possibly on distinct probability spaces but having 
common sample space S, is defined as weak convergence of their laws: 

Y n -i^> Y C(Y n ) ^> C{Y) E(f(Y n )) > E(f(Y)) , V/ e C b (S,R) . 

n— >oo n— >oo n— >oo 

If S is separable and is given the Borel cr-field, then the weak topology is metriz- 
able and V(S) is separable (Ref. [THl Theorems 3.3.1 and 3.1.7). 

If S is not separable, then the Borel cr-field is usually too strong to sustain 
reasonable probability measures, and S must be given a weaker, separable, cr-field. 
This causes problems between topological and measure-theoretic issues, and classic 
results such as the Portmanteau theorem (Ref. I16( Theorem 3.3.1) may fail to hold. 
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The natural cr-field on Z?(M+,iS>) is the product (or projection) cr-field of the 
cr-field on S, and will always be used in the sequel. The classical topology given 
D(R + ,S) is the Skorohod topology, which can be metrized by (3.5.2) or (3.5.21) 
in Ref. 16, If S is separable then D(R + ,S) is separable (Ref. [16j Theorem 3.5.6) 
and then, if S is given the Borel cr-field, the Borel cr-field of the Skorohod topology 
and the product cr-field coincide. For weak convergence with a continuous limit 
process, uniform convergence on bounded time intervals may be used with adequate 
measurability assumptions on the test functions (Ref. ITS"! Theorem 3.10.2). 



Appendix B. Algorithm 



In this section we present an algorithm for the numerical solution of Equation (6.1 1. 
The algorithm takes as input the initial condition f r (x, 0), assumed to be a piecewise 
constant function and the time horizon T up to which which we want to calculate 
an approximate solution. It outputs an approximation of the solution f r (x,T). It 
works as follows. 

First, in steps of At we approximate f r {x 1 t + At) by using a forward Euler 
method: 

f e (x, t + At) d ^ f(x, t) + Atd t f r (x, t) 

Here we exploit the fact that f r (x,t) is a piecewise constant function, so that we 
can calculate analytically the derivative which is a piecewise linear function. The 
expression for the derivative is explained later. Hence, f e (x,t + At) is also piecewise 
linear (in x), as it is the sum of a piecewise linear and a piecewise constant function. 
Then, we approximate f e (x, t+ At) with another piecewise constant function (which 
we will call f r (x,t + At) for simplicity) of / intervals, so that we can reuse the 
same scheme and we can compute explicitly the expression for the derivative. The 
constants are chosen so that the integral of f r (-,t) is equal to 1 (i.e. it is probability 
density). We perform this loop until we calculate f r (x,T) in steps of At. The 
algorithm is given next. 



Algorithm 1 Numerical Solution of Equation (6.1) 



Input f r {x,0),T,At,I 

Output f r (x,T) 

for t <- to T step At do 

/ e (x, t + At) «- f r (x, t) + Atd t f r (x, t) 

f r (x,t + At) <— PiecewiseConstantApproximation(/ e (ir, t + At), I) 
end for 



The method PiecewiseConstantApproximation returns a piecewise constant ap- 
proximation such that the total integral equals 1. 
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B.l. Piecewise constant approximation 

We choose as piecewise constant approximation for f e (x,t) on any interval X = 
[x s , x e ], M — (x>t) dx ^ j e center of mass. 

Proposition B.l. For any t > it holds that f Q f r (x,t)dx = 1. 
B.2. Analytical expression of dtf r (x,t) 

Now we will give an exact expression for the derivative, using the fact that f r {x, t) 
is piecewise constant. This helps to understand how the calculation of the derivative 
is implemented and its asymptotic cost. We can write, for any t, that 

f{x,t) =y^aj[H(x - x l+ i) - H(x - Xi)] 

i=l 

where H(x) is the Heaviside step function. Let us set, for any Xi and Xj, that 

■ " dof f X+A f A 

II' 3 (x) = / H(x — Xi)H{z — Xj)dz = / H(x — Xi)H(x + u — Xj)du , 
Jx-A J-A 

A 

H(x + wu — Xi)H(x — (1 — w)u — Xj)du . 

-A 

The expression of I\' 3 (x) and I^ 3 (x) depends on the relative order between x^ and 
Xj and m — max{(l — w)xi + wXj,Xi — wA} and is summarized in Tables [2] and [3j 
Finally, we can calculate d t f r (x,t) as: 

d t f r (x,t) - 2 ;/;•';,-: + l\ +1 - 3 + \x) - I^ +1 (x) - I? 1 ' 3 ' (x)) 

+ 2 5>a J (/«(z) + l l 2 +W {x) - I? +1 (x) - Il +1 ' 3 (x)) . 



Case 


II' 3 (*) 










( if x < Xj - A 


■t i 


< Xj — A < 


Xj + 


A 


I x - (xj - A) if Xj — A < x < Xj + A 
{ 2A if Xj ■ + A < x 










( if x < Xi 


Xj 


- A < Xi < 


Xj + 


A 


< x — (xj — A) if Xi < x < Xj + A 
{ 2 A if Xj + A < x 


Xj 


- A < Xj + 


A < 




JO if x < Xi 
\ 2A if Xi < x 



Table 2. Expression for I^' 3 (x) 
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Case 




m < Xi + wA < xj — (1 — w)A < 
Xj + (1 - w)A 


1 


if x < Xj - (1 - w)A 
^± + a if xj - (1 - mj)A < x < x 3 + (1 - mj)A 
2A if Xj + (1 - w)A < x 


Xi + wA < m < Xj — (1 — w)A 
< Xj + (1 - w)A 


1 


if x < Xj - (1 - w)A 
^± + A if Xj - (1 - w) A < x < x 3 + (1 - 'to) A 
2A if Xj + (1 - w)A < x 


m < Xj — (1 — tojZX < Xi + 

< Xj + (1 - iu)A 




( if x < Xj - (1 - w)A 
\ iixj-(l-w)A<x<x i + wA 
j x ~ Xj +A if Xi + wA < x < Xj + (1 tt>)A 
[ 2A if Xj + (1 - iu)A < x 


Xi + «iA < Xj — (1 — w)A < m 
< Xj + (1 - wi)A 


] 




if x < m 

+ A if m < x < Xj + (1 - w)A 
2A if Xj + (1 - io)A < x 


m < Xj — (1 — w)A 
< Xj + (1 - u))A < Xi + toA 


1 




1-w 

A- ^ 
2A 


if x < Xj — (1 — ui)A 
^ if Xj - (1 - iu)A < x < Xj + (1 - iu)A 
— if Xj + (1 — iu) A < x < Xi + «)A 

if Xi + wA < x 


Xi + «iA < Xj — (1 — w)A 
< Xj + (1 - w)A < m 


JO if x < m 
\ 2 A if m < x 


Xj — (1 — to) A < m < Xi + «>A 
< Xj + (1 - w)A 




(" if x < m 

X~Xj_ _ a^-* ;f < < wA 

I 1 — iu it? — — 4 

1 ^J+A ifxi + wA<x<Xj + (l-tt))A 
[ 2A if Xj + (1 - wi)A < x 


Xj — (1 — to) A < Xi + «iA < m 
< Xj + (1 - wi)A 


1 




if x < m 

+ A ifm<x<x J + (l- ui)A 
2A if Xj + (1 - io)A < x 


— (1 — ui)A < m 
< Xj + (1 - w)A < Xi + wA 




C if x < m 

x-xj_ xi-x ifm<x<x j + n- w )A 

' l — w w — — J ' \ / 

j A - if Xj + (1 - w)A < x < Xi + wA 
{ 2A if Xi + toA < x 


Xj - (1 - w)A 
< Xj + ioA < Xj + (1 — u?)A < m 


f if x < m 
\ 2A if m < x 


Xj - (1 - w)A < Xj + (1 - w)A 
< m < Xi + wA 


\ 


if x < m 
A — x '~ x if m < x < Xi + wA 
2 A if Xi + wA < x 


Xj - (1 - w)A 
< Xj + (1 — w)A < Xi + uiA < m 


JO if x < m 
\2A if m < x 



Table 3. Expression for I^ 3 (x) 



B.3. Error Bound 

To calculate the error made by our approximation, define 



g s (x,t) = f{x,t) ift>s>0, g s { Xl t) a " f r (x,t) if < t < s, 
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and let v\(dx), v*(dx) and ^ s {dx) be the measures associated to f e (x,t), f r (x,t) 
and g s (x,t) respectively. Note that v*(dx) = fx\{dx). Thus, we want to bound 



£tot = \Ho(dx) - vJ(dx)\ T = 



T/ (At) 
k=l 



T/ (At) 

< \vJk-l)At( dx ) ~ VkAt( dx )\T- 

fe=l 

We can bound the error done in each iteration of the loop by decomposing it as 

\^{ dx )-^t-X)At{ dx )W 



< \v^\dx) - v^\dx)\ T + \v k *\dx) - ^-i)At( dx )\ 



dor 



-c.s i °e-u- 



Proposition B.2. Let M(t) = \f r (x, i)|oo- If |/ r (a;,0)|oo = M(0) = M < oo then 
the following uniform bound holds: 

IdtfOr^AOU <c(M,T), V0<fc< £-1. 

This results in the following bound for e c . s : 

£ c .s < c(M, T)At. (B.l) 

The constant c(M, T) needs to be evaluated empirically; in practice, numerical 
experiments have shown that it is of the order 10~ 6 — 10~ 7 . Then, with the known 
fact that 



Proposition B.3. 



0((At) 2 ), 



(B.2) 



using Equations (B.l I and (B.2) yields 

,fcAt/j \ ,.kAt 



\fi k k H(dx) - ^\ )At (dx)\ T < e c . s + e eu = cAt + {(At) 2 ) . 
Finally, we bound \^ T k _ 1 ) At {dx)~^l At (dx)\ T in terms of \^\ / At {dx)-^\~ )At (dx)\ T . 



Proposition B.4. For all 1 < k < ^ and for all t > kAt we have that 



MftAt(dx) - M(fc_l)Ai( da; ) 



< e 8(t-fcAt) 



^(dx) - fi^ 1)At (dx) 



Combining the previous propositions, we obtain: 

Theorem B.5. For any fixed T , the error of the method is C + O(At), where C is 
a constant that depends on c and T . 
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B.4. Complexity 



We will now give the complexity analysis of the algorithm. The computation of the 
derivative takes 0(I 2 ), where / is the number of intervals, since there is a double 
sum over / intervals. Also, this produces 0(I 2 ) splines because every I^ 3 (x), k = 1, 2 
is composed of at most 4 splines. Since the splines are not produced in increasing 
order of a;, we need to sort them, which takes 0(I 2 log/) time. Finally, we only need 
one pass to make the piecewise constant spline approximation since now everything 
is sorted. This takes 0(I 2 ) time. Since all this loop is executed ^ times, the running 
time has complexity O (^I 2 log/). 

Appendix C. Proofs 



C.l. Proof of Proposition 3.2 



By definition, since h is convex, 

h(wx + (1 — w)y) < wh(x) + (1 — w)h(y) , 

h{wy + (1 — w)x) < wh(y) + (1 — w)h(x) , 

with strict inequalities if h is strictly convex except when x = y or w € {0, 1}, and 
summing these two inequalities yields the result. 

C.2. Proof of Proposition \3.5\ 

The first statement is obvious, since a partial consensus is an absorbing state. 
We prove the second statement. It follows from the second statement in Corol- 



lary 3.4 that, if the two peers, say (i,j) chosen at any time slot k! are such that 



|Xf (k') - Xf(k')\ < A and X? (k') ^ Xf (k'), then fi%(k' + l) < ^(k'). Assume 
now that the hypothesis of the second statement holds. It follows that all peers cho- 
sen for interaction at times k' > k have reputation values that either differ by more 
than A, or are equal, thus, at any time slot k' > k, the interaction has no effect. It 
follows that M N (k) = M N {k') for k' > k. 

Further, assume that M N (k) is not a partial consensus. Thus, there exists a 
pair of peers (i,j) such that \X t N (k) - Xf{k) \ < A and X? (fc) ^ Xf(k). The 
pair is never chosen in a interaction at times k' > fc, for otherwise this would 
contradict the fact that M N (k') is stationary. But this occurs with probability 0. 

C.3. Proof of Proposition \3.7\ 

Let i and j be the peers selected for interaction at time k + 1. If at time k they 
were in different clusters, then nothing happens and the proposition holds. Assume 
now that at time k they were in the same cluster, say Ci. Let i' be a peer not in 
Ci at time fc; at time k + 1 after interaction, the opinions of i and j have moved 
closer, hence farther from i' to which they are still not connected. Hence, the only 
difference between connections at time k and k + 1 concern pairs of peers that that 
are both in the same cluster, and the result easily follows. 
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C.4. Proof of Theorem 

Let <J 2 (k) be the variance of M N (k) (we drop superscript N in the notation local 



to this proof). By Corollary 3.4 cr(k) is non-decreasing and non-negative, and thus 
converges to some ct(oo). 

For k > K N the set of clusters remains the same, C N (k) = {Ci, .., C^}, and we 
can thus define the diameter of cluster l x € {1, . . . , L N } by 



S tl (k)= max |Xf (Ar) - X? (k)\ (C.3) 



and set 



5i 1 = lim sup (5^ (fc) 

Assume that 8i x > for some i\. Since the sequence cr 2 (A:) converges, there exists 
some random time K\ > K N such that for all k,k' > K\ we have 

W)-^(*)|<*^(%) a , (C.4) 
and there is an infinite subsequence of time slots K 2 (n) > i^! for n £ N such that 

^(X 2 (n))>^>0. 
For A: > if^, let (I(k), J(k)) be a pair of peers that achieves the maximum in 



(C.3 1 and let Ef. be the event "the pair of peers selected for interaction at time 



k is (/(fc), J(fc))" . The probability of E^, conditional to all past up to time slot 
k, is jvTlf^Ij ' tnus ^ s cons tant and positive. Thus the probability that E}. occurs 
infinitely often is 1, i.e., with probability 1 we can extract an infinite subsequence 
of time slots K^n) of K2(n) such that E K3 ( n < j is true. The following lemma then 
implies that 



which contradicts (C.4), which proves by contradiction that 5^ = 0. 



Lemma C.l. Let be the pair of peers chosen for interaction at time slot k. 

Assume that \Xf Y (k) — Xj^fc)] < A. Then the reduction in variance is a 2 (k + 1) — 

Proof. By direct computation. □ 

Let (k) be the empirical mean of cluster t\ at time k > K . Since interactions 
that modify the state of the process at times k > K are all intra-cluster, it follows 
that m x {k) = (J.ei(K N ) := jU^(oo) for all k > K N . For i € C tl it holds that 
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l^fX^) — Wi(^)| ^ Sg 1 (k) —> 0, and hence X^(k) — > fXg 1 (oo) as k — > oo. Thus, for 
any continuous / : [0, 1] — > M: 

lim (f,M N (k)) = 1^ ^/(^M) 

<i=l 

where iV^ is the cardinality of C^. This shows that, with probability 1, M N (k) 

converges to M N (oo) = F ^!^ Wl («>)- 

It remains to show that M^oo) is a partial consensus. This follows from the fact 
that if i and j are not in the same cluster at time slot k, then \Xf (k) — Aj^fc)) > A, 
which implies that \in 1 (k) — /j,i 2 (k)\ > A if i\ ^ li and, since, ^e 1 (k) is stationary 
for k large enough, that \^i l {oo) — /.t£ 2 (oo)| > A. 



C.5. Proof of Theorem 4.2 



We write (4.7) and (4.8) in the notation of Section 2.2 in Graham^, in which the 



corresponding equations are (2.5) and (2.7), and 

A(n)h(x) = 2([h(wx + (1 - w)y) - h(x)]l { \ x _ y \< A} , n(dy)) 
(h(z) — h(x))J(n,x,dz) 



for J(fi, x, dz) the image measure of l{\ x -y\<A}2lJ-(dy) by y n- wx + (1 — w)y. Since 
\J(/j,,x,-)\ < 2 and \ J{[1, x, •) — J{y,x, -)| < 2|/i — the assumptions of Proposi- 



tion 2.3 in Ref . IT51 are satisfied, yielding the results. The family (4.7) is uniformly 



bounded by 4 in operator norm, and thus there is a well-defined inhomogeneous 
Markov process with generator A{m{t)) at time t and arbitrary initial law. 



C.6. Proof of Theorem 

First, the proof of |7[). The generator A N corresponds to the "binary mean-field 
model" (2.6) in Graham-MelearcP^l with N instead of n and d = 0, and (using 
Ei<^<jv = 2 T,i<i<j< N ) "J um P kernel" 

fl(x,y,dh,dk) = l{|i- y |<A}2<5{ (to — — w)y,(w— l)y+(l — w)x} {dh, dk) 

which is uniformly bounded in total mass by A — 2. We conclude with Theorem 3.1 
in Ref. HI and the triangular inequality | i J2i=i £(^f ) ~ QW < \C(Xf ) - Q\ T 
(the Xf are exchangeable). 

Now, the proof of |||). As in the proof of Theorem 3.1 in Ref. [22 



,A--l£/:(Af)) 2 ^ 2 



5>(Xf)-E[«KXf)]) 
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in which 

JV 



]>>(Af)-E[0(Xf)]) 



i=l 



N 



= ^(lf)-E[0(lf)]) 2 



i=i 

E 

l<i¥=3<N 



MX? ) -E[0(lf )])(<HA7 ) -E[^(Xf )]) 



where the first sum on the r.h.s. has A terms, the second N(N — 1), and 

E[(^(Xf ) -E[0(Xf -E[0(Xf )])] 

= E[0(Af )0(Af )] - E[0(Xf )]E[0(A7 )] , 

and we conclude to the first formula in using |7p for k = 2. 

Classically, the weak topology in the Polish space P(D(R + , [0, 1])) has a 
convergence-determining sequence (<7 m )m>i of continuous functions bounded by 1 
(such a sequence is constructed in the proof of Proposition 3.4.4 in Ethier-Kurtz^), 

1 /2 

and can thus be metrized by d(P, Q) — (Xh>i 2 _l (g m , f — <3) 2 ) ■ Moreover, the 
first formula in and the second in Q imply that E(d(K N , Q) 2 ) goes to 0, which 
proves convergence in probability for A N . 

The result for A N implies the result for its marginal process M N as a quite 
general topological fact, since the limit marginal process m is continuous and the 
spaces are Polish (Theorem 4.6 in Graham- MeleardJ^ Section 4.3 in MelearcP^); 
proofs first use the Skorohod topology, and then Theorem 3.10.2 in Ref. HT)1 



C.7. Proof of Theorem \4^4_ 

Let X N : K + - 



OI <W(#) = ^, «.e., by 



+ be the (random) time-change given by the linear interpolation 



t g 



k k + 1 

A' A 



^ X N (t) = (k + l-tN)^r + (tN - k) 



k G N. 



Then (4.4 1 implies that 



X" (t) = X" (X N (t)) , teR+, 
so that their atomic distance is null. The triangular inequality yields, for k G N, 



\\ N {t)-t\ < 



Tk _ k_ 
N N 

and hence, for any T > 0, 



-(T k+1 -T k ) + -, 



t G 



k k + 1 



N A 



sup |Ajv(t) - 1\ < ^ sup |T fe -fc| + 4 sup (T fe+1 -T fe )+ ' 



0<f<T 



A 



0<fc<LWTJ 



AT 



0<fc<LAfTJ 



A 



For e > 0, Kolmogorov's maximal inequality implies that 

x 1 [NT] 

\T k ~ k\ > e < — - 2 ]T var(T l - IW) = 



1 



A 



sup 



Q<k<[NT] 



[NT\ 

e 2 N 2 
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and classically 

p(4 sup (T k+1 -T k )>s) =l-(l-e- N ^ NT i +1 <([NT\+l)e- 

V v 0<fc< L-/VTJ / 

Hence, for all 5 > 0, 

lim P[ sup |Ajv(f) - t\ > 5 ) = 0, 
from which the result follows. 



C.8. Proof of Theorem gT5| 

Result fTl) follows from the previous convergence in probability result and Theo- 
rem |4.3| using either the uniform continuity of the test functions (for the atomic 
metric) or Corollary 3.3.3 in Ethier-Kurt^Sl (for the usual metric). Result Q), which 
involves Polish spaces, follows as for Theorem |4.3| 



C.9. Proof of Proposition \ 5. J| 

For < b and t E [0,6] define u(t) := a 2 (b — t) — a 2 (b) . Note that H\{t) is a constant 
thus u{t) = Hiib — t) — A t 2(&)- By the alternative definition of Problem 1 



u(t) = - / Uwx + (1 - w)y) 2 + (wy + (1 - w)x) 2 - x 2 - y 
Jb-t J[os] 2 

l{\ x - s \<A}n r i(s)(dx)m(s)(dy)ds 



21 



By Proposition |3.2| the bracket is nonpositive, and the indicator function is upper 
bounded by 1 thus 



«(*)<-/ / [(wx + (1 - w)y) 2 + {wy + (1 - w)x) 2 - x 2 - y 

Jb-t J[0,1] 2 

m(s)(dx)m(s)(dy)ds 
= k[ a 2 (s)ds = K (a 2 {b) + [ u(s)ds 

Jb-t \ Jo 

with K — Aw(l — w). By Gronwall's lemma: 

u(t) < Ka 2 (b)t + K 2 a 2 (b)e Kt f se- Ks ds = a 2 (b) ( 

Jo 

Let t = b and the proposition follows. 



21 



C.10. Proof of Proposition [5.4\ 

Fix some to > 0; we will show that essinf(m(t)) > essinf(m(i )) for every t > t - 
Clearly, it is sufficient to consider the case essinf(m(to)) > 0. Take some arbitrary 
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a < essinf(m(to)). Let h{x) — l{ x < a } & n d f(t) = (h,m(t)). We have if (to) = and, 
by definition of Problem 1: 

¥>(*) < 2 / (\h(wx + (1 - w)y) - h{x)\ , m{s)(dx)m(s)(dy))ds 

J t 

Note that |/i(wa; + (1 — w)y) — h(x)\ < 1 and that h(wx + (1 — w)y) — ^(a;) 7^ 
requires either x < a,y > a or x > a,y < a. Thus 

¥>(*)< 2 / 2</>(s)(l - ip(s))ds < 4 / </?(s)ds 

7 to J to 

By Gronwall's lemma, this shows that = for i > to. Thus m(i)[0, a] = for all 
t > t and this is true for any a < essinf(m(fo)) thus essinf(m(£)) > essinf(m(t )). 
This shows essinf(m(i)) is non decreasing. The proof is similar by analogy for the 
ess sup. 

C.ll. Proof of Theorem [5^| 

1. We show that m(t) converges to some probability 771(00). This follows from Propo- 
sition 3.2 applied for example to the family of functions h u : x — ¥ e~ ux indexed by 
u € [0,oo). For any fixed w, (h u ,m(t)} is a nondecreasing function of t and is non- 
negative, thus converges as t — > 00. The limit is a probability (apply convergence 
to the constant equal to 1). 

2. We would like to conclude that 771(00) is a stationary point, i.e. 
(A(m(ao))h,m(oo)) — for any h g L°°[0, 1], however there is a technical diffi- 
culty since the definition of A involves the discontinuous function li\ x -y\<A}- We 
circumvent the difficulty as follows. For e > and smaller than A, let £ £ (x) be the 
continuous function of x € R + equal to 1 for x < A — e, for x > A, and the linear 
interpolation in-between. We have l{a;<A-e} < £s( x ) < 1{><A} f° r & U x > 0. Let 
h(x) = x 2 . By the alternative definition of Problem 1, for t and u > 0: 

(h,m(t + u)) - (h,m(t)) 

/t+u 
((x — y) 2 l £ (\x — y\), m{s){dx)m{s)dy)ds 

Fix u > and let t — > 00. By weak convergence of the product measure m(t) ®m(t) 
it follows that 

< — 2w(l — w)u((x — y) 2 £ e (\x — y\), m(oo)(dx)m(oo)dy) 
and thus ((x — y) 2 £ e (\x — y\), rn{po){dx)m{po)dy) = from where we conclude that 
((x - y) 2 l{\ x ~ y \<A-e}, m(oo)(dx)m(oo)dy) = (C.5) 

for all e € (0, A). 

3. Fix some e > and integrate the previous equation with respect to y; it comes 
that (r(x), m(oo)(dx)) — with r(x) = ((y — ^) 2 l{| y - x |<A-e}! m ( O0 )(dy)}, thus 
there is a set f^i C [0, 1] with m(oo)(Qi) = 1 and r(x) — for every x £ fii. Let X\ 
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be an element of f2i (which is not empty since m(oo)(fii) = 1). Then r(xi) = and 
thus ra(oo) ([(xi — A + e, xi) U (xi,xi + A — e)] n [0, 1]) =0 and the restriction of 
m(oo) to (xi — A + s, x\ + A — e) n [0, 1] is a dirac mass at x\. Apply the same 
reasoning to the complement of (xx — A + e, x\ + A — e), this shows recursively 
that m(oo) is a finite sum of Dirac masses, i.e. m(oo) = Xa=i Q^x-j f° r some JsN, 
a t > 0, X)i=i = 1 an d Xi € [0, 1]. 



Assume that |xj — Xj| < A for some i ^ j. Apply Eq.(|C.5|) with e = A — x ^ 



i 

2 



The right-handside of Eq.(C.5) is lower bounded by aiCtj(xi — Xj) > 0, which is a 



contradiction. Therefore |xj — Xj\ > A for all i ^ j. 
C.12. Proof of Proposition \5. 7| 

First we show that if v € P n+ i(/xo) then there exists some i/ € P n (/xo) with (ft,, z/) < 
(ft, i/), which will clearly show the proposition. 

n-1 

^' = X] Qfi<5 Xi + ((«n + «n+l) 



We are given v = Yh=i q '^, e -fri+i (/■*<))■ Let x' n = a " x ^+°^+i x "+i and 



We have v' g P n {n) and by convexity of ft: 

(a n + a„+i) ft(x^) < a„ft(x„) + a n+ ift(x„+i) 
thus (ft, z/) < (ft, v) as required. 

C.13. Proof of Theorem [0| 



By hypothesis (ft, mo) < <? and since ft is continuous, by Theorem 5.5 (ft, m(oo)) < 



g. Since the mean of m(oo) is also /zq (again by Theorem 5.5 applied to ft(x) = x 



it follows that g is not in Qd(ft,A*o)- Together with the hypothesis q € Q n (ft, /xq) 



Proposition 5.7 implies that c < n. 



C.14. Proof of Proposition 5.11\ 



Let m'(£) be the image measure of m(t) by x n- 1 — x. By direct computation and 
the alternative form of Problem 1, it follows that m'(t) is solution to Problem 1 
with initial condition m'(0) = m(0). By uniqueness, m'(t) = m(t). 

C.15. Proof of Proposition \ 5. Ig| 

Let v be a symmetric partial consensus with n + 1 components. We do as in 
the proof of Proposition |5.7| If n + 1 is even, we replace the two middle com- 
ponents by their weighted averages. If n + 1 is odd, we replace the three mid- 
dle components x m _i,x m = 0.5,x m+ i (with m = n/2 + 1) by two components 
(a m _ix TO _i + 0.5a m x m )/(a m _i + 0.5a m ) and (0.5a m x m + a m+ ix m+ i)/(0.5a m + 
a m +i) with weights a m _i + 0.5a m and 0.5a m + a m +i. We obtain some v' € <SP n 
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and (h,v') < (h,v) for any convex h, thus if q £ SQ n (h) we must also have 
q £ SQ n +l(ti). 



C.16. Proof of Theorem 

The proof is similar to Theorem [57 



C.17. Proof of Theorem 

Assuming that m is absolutely continuous, the fact that m(t) is absolutely con- 
tinuous can be proved by probabilistic arguments which use representations by 
inhomogeneous Markov processes with uniformly bounded jump rates. 

More precisely, the proof of Theorem 2.1 in Desvillettes et alp^i for a class of 
equations (the generalized cutoff Kac equation) with the same probabilistic struc- 
ture as ours, extends immediately to the present situation. It is an extension of 
Theorem |4 . 2 1 proved using only its hypotheses. 

If rn = (m(t),t £ K.+) is a solution of Problem 1 and m(t)(dx) = f(x, t) dx then, 
for any bounded h, an elementary change of variables yields 

h(x) f (x , t) dx — / h(x)f(x,0)dx 



h(wx+ (1 - w)y)l {lx _ yl < A} f(x,s)f(y, s) dxdyds 



2 J II h(x)l{\ x - v \<A}f(x,s)f(y,s) dxdyds 



2 

w Ju 



h{x') 



x' — Aw 



W 



dx' ds 



h(x)f(x,s) 



-A 



f(y,s)dy 



-A 



dx ds 



from which (6.1) readily follows 



The converse statement follows by integrating Eq.(6.1) by h(x) dx, which after 
the rev erse change of variables yields Problem 1 as a weak formulation. 

and 



Eq.(6.2| is obtained similarly using the change of variables x' 

wy—{l — w)x 



I wx— (1 — w)y 

2w-l 



2w-l 



C.18. Proof of Proposition 6.2 



Since f(x,i) is non-negative, we have: 



df(x,t) < 2 
dt ~ w 



f(y,t)f x - (1 - w)y ,t)dy. 



x—wA 



For a fixed arbitrary t, let Aj = {x £ Supp(/(a;,t))|i — 1 < f(x,t) < i}, i > 
be the level sets. Note that Aj = for all j > \M(t)] and that the Ai are disjoint. 
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For any x, we have that: 



r 4 /(M)/(^*, ( i* 

Jx-wA \ w 



E< 



, a; — (1 — w)y . i \ 
y G Ai, ^ — G Aj } ) max^,.?} 2 . 



Using the fact that the Ai are disjoint we can get that: 



= ?>({' 



2 



a; — (1 — w)y 



w 



e A 3 \) max {i, j} 2 



E^ 



y g Ai, 



x — (1 — w)y 



g y A fe 



y.u. r ;l ' r:/; .i.lV - ^ 



We can bound I 1 and I 2 now as: 



to ^ 1 — w t-r 1 



subject to the following restrictions: 



V fi{Ai) < 1, YVi - I)m(^) < / f{x, t)dx = 1. 
Plugging the second restriction into the bound of I\ and 7 2 , we get that: 



[M(t)l 



[M(i)l-1 



E M^). a < + E M*) ( 

1=1 1 v / 1 2—1 N 



\M(t)V 



^— ^ [M(t)]-i)(i- fM(t)l). 



rM(*)i-i [M(t)i 

The maximum of the RHS is attained when fi(Ai) = V i > 1 and /Lt(Ai) is as big 
as possible. By the first restriction, fi{Ai) = 1. In that case, we have that: 

E /^)* 2 ^ rM(t)i -i + 1 - wm + 3 - MW + 4 ' 



Therefore: 



sup|^ M (A,)« 2 | < M(t)+4. 
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Finally, for any x we have: 
2 - r A f(y, 0/ (^=^>, t) dy<h+h<(- + — I (M( t ) + 4). 

W Jx-wA V w J \ w l-W.' 



which means that: 



.!/'(/), (l + ^_](M(t)+4). 
to 1 — to 



Integrating, we get the result. 



C.19. Proof of Proposition \ 6. 3\ 

Again, since f{x,t) is non-negative, for all x, 



f(x,t) 



<max| 2 / + f(y,t)f( X (1 W)V ,t) dy, 2f(x,t) I [ + f(y,t)dy 

Ix-wA \ W / \Jx-A , 



w 



On the one hand, 
2/(x,t) 



x+A \ ,-1 

f(y,t)dy < 2M(t) / /(y,t)dy < 2M(t), 

-A / JO 



on the other, using Proposition |6.2[ 



therefore 



<9/ 



/(•,*) 



I — + t— — ) (M(t) + 4). 
w 1 — w ' 



C.20. Proof of Proposition B.l 



Let / r (x,i) be defined piecewise in the intervals Xi = [xj,Xj+i] and let Mj be 
constant chosen for the piecewise constant approximation on the interval Xi. We 
have that, independently of t: 



Xi+l 



r i -i / 1 r I f e (y^) d y r 

/ f r {x,t)dx= / ViW i lx l dx = V / ^ dx= / 

JO JO i=1 i=1 JXi X i+l ~ X i JO 



f e (y,t)d y . 
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C.21. Proof of Proposition B.2 



Keeping in mind that for any interval, the slope of f e (x,kAt) is bounded by 

" >.( fe - 



2A WWA fe yielding: 



e c . s .(I) < J Max - 2 Sl0pe 0^ = At\d t f(x, (k - 1)A*)U. (C6) 
On the other hand: 

M(At) = |r(x,Ai)U < l/'^.AtJU < |/ r (x,0)U + At\d t f r (x,0)\oc 

< M + AtKxM + AtK 2 = (1 + AtKi)M + AtK 2 , 
where K\ = — + t^— , K 2 = — + t^— ■ The first inequality is true because when we 

w 1 — w ' * w 1—w ^ J 

approximate by piecewise constant splines, the maximum of the function decreases 
and the third is true by Proposition [672] Note that in order to be able to apply it we 
are implicitly using Proposition |B.l| as the total mass is conserved. By induction: 

T/Ai-l 



M ( — At ) < (1 + AtK x )£iM + AtK 2 ^ (1 + AtK^ 1 

i=0 



= (1 + AtK x )^M + §((1 + AtKx)£i - 1) < §(1 + MKi)& ( M + § 

Ai Ai \ Ai 

We can now bound M(kAt) in the following way. As K\ and K 2 are positive, taking 
into account that (1 + A"iAt) sr is decreasing with At, we have for any k: 

M(kAt) < (1 + AtK^M + — (1 + AtKx)^ < e KlT ( M + § 

Ax \ Ax 

Using Proposition |6.3| 

|d t / r (>, (fc - l)Ai)U < AW((fc - l)Ai) + a" 2 < Kie^ 1 ^ (m + + K 2 = c. 
Combining this equation with equation ( |C.6[ ) we get the desired result. 

C.22. Proof of Proposition [iO| 

We have that: 

kAt/j\ ..kAt 



e 



\g ik ~ 1)At (x, kAt) - g (k - 1)At (x, (k - l)At) - Atd t g {k - 1)At {x, {k - l)At)\dx 



o 



< ^(At) 2 !^^- 1 ^*^, (k - l)At)U + O ((At) 3 ) 



By Corollary |6.4[ we can bound, for any k: 

\d 2 tt g {k ^ At {x, {k-l)At)U < lQA\d t g^ At {x, (k-l)At)Ug^ At (x, (k-l)At)\ c 
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therefore: 



Co 



K 2 e 



,KiT 



M 



K2 



= C 2 



Sen < y (A*) 2 + °(( A< ) 3 ) = °(( At ) 2 )- 



(C.7) 



C.23. Proof of Proposition fB^| 



d_ r 1 

diJo 



\g kAt (x,t)-g( k -V At (x,t)\dx < / \d t g kAt (x,t)~d t g (k -^ At (x,t)\ 



< 



g kAt (x,t) g kAt (y,t)dy + g^ At (x,t) g^ At (y,t)dy 



x+A 



1 2 

a:— wA 



,t)dy 



x—A 

^-DA t(2/j ^l)A t ^-(l~«,) g > ^ dy 



x—A 



= I + J. 



We will first bound /. We have that: 



I<2 \g^ At (x,t)-g kAt (x,t)\ g^ At (y,t)dzdx 



o 



.r+A 



x-A 



x+A 



+2/ g kAt (x,t) \g^ At (y 1 t)-g kAt (y,t)\dzdx = I 1 +I 2 . 

Jx-A 



On the one hand: 



h<2 / ^-^(^-^Mldz, 



on the other: 



l a <2/ g kAt (x,t) \g^ At (y,t)-g kAt (y,t)\dzdx 



Now we will bound J: 



J < 



1 px-\-wA 

, / 5 fcAt (y,0 

w JO Jx-wA 

? *A* f a: - (1- w)y ^ _ 9 ( fe _i)At f (1- w)y 

1 />a;+iijA 

5 feA *(y,t)- g^ At (y, t) 



w 



dzdx 



x 9 



Jx—wA 
(fc-l)At f X- 

to 



£ ) dzdx = J\ + J2 
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Ji = 2 f [ X+A g kAt (x,t) g kAt (y,t)~g^ At (y,t) 

JO Jx-A 



dzdx 



<2 / \g kAt (x,t)-g^ At (x,t)\dx 
Jo 



1 i-x+A 



J 2 -2 



Jx-A 



..kAt 



(x,t)-g^ At (x,t) g^ At (y,t)dzdx 



<2 / \g kAt (x,t)-g^ At (x,t)\dx. 
Jo 

Adding all the equations together we get that: 

/ \g kAt (x,t) ~ g^ k ^ At (x,t)\dx < I + J<h+h + Ji + J 2 
Jo 



d_ f 1 
di. 



<8 / \g kAt (x,t)-g^ At (x,t)\dx. 
Jo 



Integrating: 

tfkAt( dx ) ~ tf{k-i)At( dx ) 



\g kAt (x,t)-g^ At (x,t)\dx 



< e 8 (^ fcA *) / \g kAt (x, kAt) - g {k - 1)At (x, kAt)\da 
J o 



&{t-kAt)\..kAt( H „\ _ ..kAt 



\ti%(dx)-tf£ 1)M (dx)\, 



as we wanted to prove. 



C.24. Proof of Theorem \B7B\ 

T/(At) T/(At) 

e t ot< \^k-i)At( dx ) - ^IaMx)\ t < e 8T |M( fe A _*i)At(^) - VkAtidx) 

k=l k=l 



8T 



At 



(cAt + O ((At) 2 )) =C + (At) . 
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